Detecting click fraud in online advertising: a data mining approach

نویسندگان

  • Richard Jayadi Oentaryo
  • Ee-Peng Lim
  • Michael Finegold
  • David Lo
  • Feida Zhu
  • Clifton Phua
  • Eng-Yeow Cheu
  • Ghim-Eng Yap
  • Kelvin Sim
  • Minh Nhut Nguyen
  • Kasun S. Perera
  • Bijay Neupane
  • Mustafa Amir Faisal
  • Zeyar Aung
  • Wei Lee Woon
  • Wei Chen
  • Dhaval Patel
  • Daniel Berrar
چکیده

Click fraud–the deliberate clicking on advertisements with no real interest on the product or service offered–is one of the most daunting problems in online advertising. Building an effective fraud detection method is thus pivotal for online advertising businesses. We organized a Fraud Detection in Mobile Advertising (FDMA) 2012 Competition, opening the opportunity for participants to work on real-world fraud data from BuzzCity Pte. Ltd., a global mobile advertising company based in Singapore. In particular, the task is to identify fraudulent publishers who generate illegitimate clicks, and distinguish them c ©2013 Richard Oentaryo, Ee-Peng Lim, Michael Finegold, David Lo, Feida Zhu, Clifton Phua, Eng-Yeow Cheu, Ghim-Eng Yap, Kelvin Sim, Minh Nhut Nguyen, Kasun Perera, Bijay Neupane, Mustafa Faisal, Zeyar Aung, Wei LeeWoon, Wei Chen, Dhaval Patel, and Daniel Berrar. Oentaryo, Lim, Finegold et al. from normal publishers. The competition was held from September 1 to September 30, 2012, attracting 127 teams from more than 15 countries. The mobile advertising data are unique and complex, involving heterogeneous information, noisy patterns with missing values, and highly imbalanced class distribution. The competition results provide a comprehensive study on the usability of data mining-based fraud detection approaches in practical setting. Our principal findings are that features derived from fine-grained timeseries analysis are crucial for accurate fraud detection, and that ensemble methods offer promising solutions to highly-imbalanced nonlinear classification tasks with mixed variable types and noisy/missing patterns. The competition data remain available for further studies at http://palanteer.sis.smu.edu.sg/fdma2012/.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Memory Efficient Technique for Fraud Detection in Web Advertising Networks

The advertising network considered as the middle man in web advertising between advertisers and publishers. This paper presented an intelligent and memory efficient Fraud detection technique with intelligent classification engine to be used by the advertising networks to scan clicks and impressions offline streams happen on publisher side for the purpose of detecting click fraud and impression ...

متن کامل

Combating online fraud attacks in mobile-based advertising

Smartphone advertisement is increasingly used among many applications and allows developers to obtain revenue through in-app advertising. Our study aims at identifying potential security risks of mobile-based advertising services where advertisers are charged for their advertisements on mobile applications. In the Android platform, we particularly implement bot programs that can massively gener...

متن کامل

Whose Click Fraud Data Do You Trust? Effect Of Click Fraud On Advertiserâ•Žs Trust And Sponsored Search Advertising Decisions

Online sponsored search has emerged as a dominant business model for majority of search engines and as a popular advertising mechanism for online retailers. However, sponsored search advertising is being negatively impacted by click fraud which involves the intentional clicking on sponsored links with the purpose of gaining undue monetary returns for the search engine or harming a particular ad...

متن کامل

The Study on Supervision Model for Online Advertising Click Fraud

Considering the click fraud in the online advertising market, a basic game theoretic model for click fraud is built firstly. In this model, the Ads Network can choose to make click fraud supervision or trust, and advertising publishers can choose to publish advertisement honestly or to cheat. In this paper, we get the result of the mixed strategy Nash equilibrium solution firstly and then we ex...

متن کامل

Whose Click Fraud Data Do You Trust? Effect Of Click Fraud On Advertiser's Trust And Sponsored Search Advertising Decisions

Online sponsored search has emerged as a dominant business model for majority of search engines and as a popular advertising mechanism for online retailers. However, sponsored search advertising is being negatively impacted by click fraud which involves the intentional clicking on sponsored links with the purpose of gaining undue monetary returns for the search engine or harming a particular ad...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of Machine Learning Research

دوره 15  شماره 

صفحات  -

تاریخ انتشار 2014